Skip to content

Conversation

@lucasb-eyer
Copy link
Member

Adds a model "zoo" à la Caffe. Currently, this contains VGG16 and VGG19, it allows to just create the model (and then train yourself) or download the pretrained ILSVRC weights from the internet and create a pretrained version of the net. The latter is done in the example.

Again, I made the commits so that it's best to read the commits individually, the commit message contains a lot of information.

The two large json files will move from the example to the data submodule, whose PR will follow in a few days.

Looking forward to your feedback; especially, I think you may not like the change of parameters to SpatialConvolutionCUDNN?

@lucasb-eyer
Copy link
Member Author

Also, this needed the SpatialSoftMaxCUDNN which you introduced in #13 but isn't merged yet, so i just put it in here and slightly adapted it.

@ikostrikov
Copy link
Member

I think that we should remove json files from github.

- The `mode` now defaults to `cross`, which is what all other packages
  use and thus it allows loading their weights without needing to fiddle
  with mirroring/flipping them correctly.
- Adds an alternative `border` parameter for these two reasons:
    1. So that we have a flexible shortcut for 'same'
    2. So that Theano can use some tricks for 'valid' and 'full'
  The old interface is still available and used since `border=None` by default.
This change is fully backwards compatible.
(I need it for ILSVRC-pretrained VGG)
@lucasb-eyer
Copy link
Member Author

Yeah, I felt uncomfortable adding them, but for some reason didn't think of putting them into a github gist and using the download function included in this very PR. That's what I've done now.

I've also undone the change to the border parameter of SpatialConvolutionCUDNN so that it's back to the same as Torch7 by default, and the string version is just an alternative, as per our discussion. Also undone the change to Module.parameters() for the same reason.

This includes implementing a slightly fancier download functionality
which caches the data in a `~/.cache/beacon8` folder.

For the example code, this also necessitated writing `imread` and
`imresize` functions which use either `cv2` or `PIL` depending on what's
available.
@ikostrikov
Copy link
Member

Interface is still different from Torch7.

@lucasb-eyer lucasb-eyer deleted the pretrained branch November 23, 2015 22:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants